# Multilingual vision-language

Vit Gopt 16 SigLIP2 384
Apache-2.0
SigLIP 2 vision-language model trained on WebLI dataset, supporting zero-shot image classification
Text-to-Image
V
timm
1,953
1
Vit SO400M 16 SigLIP2 512
Apache-2.0
SigLIP 2 vision-language model trained on WebLI dataset, suitable for zero-shot image classification tasks
Text-to-Image
V
timm
1,191
4
Vit SO400M 16 SigLIP2 384
Apache-2.0
SigLIP 2 vision-language model trained on WebLI dataset, supporting zero-shot image classification tasks.
Text-to-Image
V
timm
106.30k
2
Vit SO400M 16 SigLIP2 256
Apache-2.0
SigLIP 2 vision-language model trained on WebLI dataset, supporting zero-shot image classification
Text-to-Image
V
timm
998
0
Vit SO400M 14 SigLIP2 378
Apache-2.0
SigLIP 2 vision-language model trained on WebLI dataset, supporting zero-shot image classification tasks
Text-to-Image
V
timm
1,596
1
Vit L 16 SigLIP2 512
Apache-2.0
SigLIP 2 vision-language model trained on WebLI dataset, supporting zero-shot image classification tasks
Text-to-Image
V
timm
147
2
Vit L 16 SigLIP2 256
Apache-2.0
SigLIP 2 vision-language model trained on WebLI dataset, supporting zero-shot image classification
Text-to-Image
V
timm
888
0
Vit B 16 SigLIP2 512
Apache-2.0
A SigLIP 2 vision-language model trained on the WebLI dataset, supporting zero-shot image classification tasks
Text-to-Image
V
timm
1,442
1
Vit B 16 SigLIP2 384
Apache-2.0
SigLIP 2 vision-language model trained on the WebLI dataset, suitable for zero-shot image classification tasks
Text-to-Image
V
timm
1,497
0
Vit B 32 SigLIP2 256
Apache-2.0
SigLIP 2 vision-language model trained on WebLI dataset, supporting zero-shot image classification tasks
Text-to-Image
V
timm
691
0
Vit B 16 SigLIP2 256
Apache-2.0
SigLIP 2 vision-language model trained on the WebLI dataset, supporting zero-shot image classification tasks
Text-to-Image
V
timm
10.32k
4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase